Automatic Text Summarization Using Lexical Clustering
نویسندگان
چکیده
The goal of automatic text summarization is to reduce the size of a document while preserving its content. We investigate a summarization method which uses not only statistical features but also the contextual meaning of documents by using lexical clustering. We present a new method to compute lexical cluster in a text without high cost knowledge resources; the WordNet thesaurus. Summarization proceeds in five steps: the words of a document are vectorized, lexical clusters are constructed, topical clusters are identified, representative words of a document are selected, and a summary is produced using query. Compared with other methods, we achieved better performance at 30%, 10% and fixed 4 sentences summary experiments. Automatic Text Summarization Using Lexical Clustering
منابع مشابه
A survey on Automatic Text Summarization
Text summarization endeavors to produce a summary version of a text, while maintaining the original ideas. The textual content on the web, in particular, is growing at an exponential rate. The ability to decipher through such massive amount of data, in order to extract the useful information, is a major undertaking and requires an automatic mechanism to aid with the extant repository of informa...
متن کاملUsing Genetic Algorithms with Lexical Chains for Automatic Text Summarization
Automatic text summarization takes an input text and extracts the most important content in the text. Determining the importance of information depends on several factors. In this paper, we combine two different approaches that have been used in the text summarization domain. The first one is using genetic algorithms to learn the patterns in the documents that lead to the summaries. The other o...
متن کاملAutomatic Knowledge Representation Using A Graph-Based Algorithm For Language-Independent Lexical Chaining
Lexical Chains are powerful representations of documents. In particular, they have successfully been used in the field of Automatic Text Summarization. However, until now, Lexical Chaining algorithms have only been proposed for English. In this paper, we propose a greedy Language-Independent algorithm that automatically extracts Lexical Chains from texts. For that purpose, we build a hierarchic...
متن کاملComputing Lexical Chains for Automatic Arabic Text Summarization
Automatic Text Summarization has received a great deal of attention in the past couple of decades. It has gained a lot of interest especially with the proliferation of the Internet and the new technologies. Arabic as a language still lacks research in the field of Information Retrieval. In this paper, we explore lexical cohesion using lexical chains for an extractive summarization system for Ar...
متن کاملComputing Lexical Chains with Graph Clustering
This paper describes a new method for computing lexical chains. These are sequences of semantically related words that reflect a text’s cohesive structure. In contrast to previous methods, we are able to select chains based on their cohesive strength. This is achieved by analyzing the connectivity in graphs representing the lexical chains. We show that the generated chains significantly improve...
متن کامل